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Application/Control Number: 09/813,71 1 Page 2 

Art Unit: 2655 

DETAILED ACTION 
Claim Rejections - 35 USC § 102 

1 . The following is a quotation of the appropriate paragraphs of 35 

U.S.C. 1 02 that form the basis for the rejections under this section made in this 

Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 
122(b), by another filed in the United States before the invention by the applicant for patent or 
(2) a patent granted on an application for patent by another filed in the United States before 
the invention by the applicant for patent, except that an international application filed under 
the treaty defined in section 351(a) shall have the effects for purposes of this subsection of an 
application filed in the United States only if the international application designated the United 
states and was published under Article 21(2) of such treaty in the English language. 

1. Claims 1-4,6,9-16,18-23,25,28-34,36-41 & 43 are rejected under 35 
U.S.C. 102(e) as being anticipated by Kanevsky et al. (U.S, Patent 6529871). 

Regarding claims 1&13, Kanevsky et al. disclose a method and system 
that is comprised of a speech-processing system/ 
apparatus/device/software (collection of methods) that resides in the 
memory of a computer system (Co! 5, Line 50 - Col 6, Line 7). 

Kanevsky et al. disclose the building of user databases [claimed PIM 
database] and models by using an enrollment process that collects 
personal information (e.g. name, address, etc) from the user (Col 8, 30 - 
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51 ). The system contains non-acoustic and acoustic models (language 
models) for many users (Col 7, Line 62 and Col 8, Line 64). 

Kanevsky et al. disclose a learning algorithm for building user model that 
include the response to queried information from operators, informational 
forms, email forms, web based forms or Integrated Voice Response (IVR) 
systems (Col 7, Line 64 - Col 8, Line 7). This data is stored together with 
the acoustic models (language models) collected by a speech recognition 
system (Col 8, Line 65). When the user calls the central server to gain 
access, the users acoustic features extracted by an automatic speech 
recognition (ASR) application and the non-acoustic data (personal 
information data) in the database is used to help in the determination of 
the identity of the speaker (Col 6, Lines 12 -56). 



Regarding claim 19, Kanevsky et al. disclose a speech-based processing 
system comprising of a database of PIM data associated with a user (Col 
6, Line 36 - 40); a set of language models (Col 7, Line 60); a language 
model server including a learning unit [learning algorithm] (Col 8, Line 33- 
52), a lookup unit (Col 7, Lines 15-40); a recognition server to recognizer 
utterance of the user (Col 8, Line 13) and a speech application to trigger 
operation of the look-up unit to identify and access a subset of the PIM 
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data specified by tine utterance using the result of recognizing the 
utterance(Col 7, Lines 30 - 40). 



Regarding claim 28, Kanevsky et al. disclose a speech-based processing 
system comprising of a processor means for executing software (Col 5, 
Line 64 - Col 6, Line 2); and a storage means having stored therein a 
learning unit to learn a set of language models based on a set of PIM data 
and a speech application to access a subset of the PIM data specified by 
a short reference to set to subset uttered by a user(Col 7, Lines 30 - 40). 

Regarding claim 30 & 37, Kanevsky et al. disclose a method of facilitating 
speech recognition comprising of using an automated language model 
learning process to acquire a set of language models based on PIM data 
associated with a user (Col 8, Lines 35 - 51); recognizing an utterance by 
the user by using one of the language model(Col 8, Lines 52 -66); and 
using the recognized utterance of the user to identify and access a subset 
of the PIM data(Col 9, Line 2). 

Regarding claims 2,20,31 & 38, Kanevsky et al. disclose the method of 
asking the user random questions [claimed short references] derived from 
the personal database. The data will include portions or a subset of the 
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data of the personal information data residing in the PIM/user database 
(Col 6, Lines 41-45). 

Regarding claims 3,1 5,22,33 & 40, Kanevsky et al. disclose that a new 
user may periodically call the central server where the learning algorithm 
may be activated. The system collects voice samples from the caller's 
answers to the plurality of questions and builds a user voice model which 
will be provided to the system the next time the speaker calls and needs to 
be identified (Col 8, Line 64 - Col 9, Line 5). 

Regarding claims 4,16,23,34 & 41 Kanevsky et al. disclose that a user 
may call the central server where the learning algorithm may be activated 
in real-time to help in the determination of the identity of the user by the 
use of stored language models based on the learning algorithm (Col 7, 
Lines 28 -33). 

Regarding claims 6,18,25,36 & 43 Kanevsky et al. disclose that the 
language model learning algorithm is based on a statistical language 
model. Kanevsky describes that a user model provided by the learning 
unit is used to determine the probability of a particular user's identity (Col 
11, Lines 3 -35). 
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Regarding claims 9, 10, 1 1 ,12,14,29,21 ,32 & 39, Kanevsky et al. disclose 
that the learning algorithm takes data form a plurality of sources, each 
module containing a set of heuristics tailored for acquiring language 
models for one of a plurality of types of PIM data. The data comprises of 
personal address book, personal calendar and email message data. 
Kanevsky et al. describes categories of personal information data. The 
categories include static features which includes phone number, time of 
day, etc (personal address book data), dynamic features which include 
trips, meetings (personal calendar data) and email, faxes, etc. The data 
also includes internal information extracted from dialog that includes 
gender, speech rate, accent etc. and external information such as name, 
address, DOB, etc (Fig 4, Col 10, Lines 42 - 59). 



Claim Rejections - 35 USC § 103 

2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for 
all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described 
as set forth in section 102 of this title, if the differences between the subject matter sought to 
be patented and the prior art are such that the subject matter as a whole would have been 
obvious at the time the invention was made to a person having ordinary skill in the art to which 
said subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 




Application/Control Number: 09/813,71 1 Page 7 

Art Unit: 2655 

3. The factual inquiries set forth in Graham v. John Deere Co., 383 U.S. 1 , 
148 USPQ 459 (1966), that are applied for establishing a background for 
determining obviousness under 35 U.S.C. 103(a) are summarized as follows: 

1 . Determining the scope and contents of the prior art. 

2. Ascertaining the differences between the prior art and the claims at 
issue. 

3. Resolving the level of ordinary skill in the pertinent art. 

4. Considering objective evidence present in the application indicating 
obviousness or nonobviousness. 



5. Claims 5,17,24,35 & 42 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Kanevsky et al. (U.S. Patent 6529871) in view of Kremer 
(IEEE 0-7803-4122-8/97). 

Regarding claims 5,17,24,35 & 42, Kanevsky et al. disclose a method and 
system that is comprised of a speech-processing system/ 
apparatus/device/software (collection of methods) that resides in the 
memory of a computer system (Col 5, Line 50 - Col 6, Line 7). The 
system uses personal data to choose language models base on input 
speech. Kanevsky et al. do not disclose the use of a language model 
algorithm that utilizes grammar induction. However, Kremer teaches a 
grammar induction technique that avoids the limitations of classical 
grammar induction algorithms (Page 1424, Paragraph 1 & 2). Grammar 
induction is a necessary process by which the language models based on 
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a finite set of example strings from the PIM data will be used to identify 
grammar, for a potentially infinite set of strings or language. 

Therefore, it would have been obvious to those skilled in the art at the time 
of the invention to modify Kanevsky et al. with the use of language models 
that utilize grammar induction as taught by Kremer since it is essential for 
language models utilized in automatic speech recognition systems. 

4. Claims 7,8.26,27,44 & 45 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Kanevsky et al. (U.S. Patent 6529871) in view of Fitzpatrick et 
al. (U.S. Patent Application 20020184033). 

Regarding claims 7,26 & 44, Kanevsky et al. disclose a method and 
system that is comprised of a speech-processing system/ 
apparatus/device/software (collection of methods) that resides in the 
memory of a computer system (Col 5, Line 50 - Col 6, Line 7). The 
system uses personal data to choose language models base on input 
speech. Kanevsky do not disclose the use of an API designed to access 
PIM data. However, Fitzpatrick et al. teach the use of Standard 
Application Program Interface (API) for integrating other Text-to-Speech 
Servers (Page 4, Paragraph 01 16). APIs are important and necessary for 
developers to efficiently integrate large-scale software applications and 
modules. 
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Therefore, it would have been obvious to those skilled in the art at the time 
of the invention to modify Kanevsky et al. with the use of APIs as taught 
by Fitzpatrick et al. since it would have made the integration of the 
learning modules with other applications more efficient. 

Regarding claims 8,27 & 45, Kanevsky do not disclose the use of 
providing the PIM data in XML format. However, Fitzpatrick et al. teach 
the use of providing documents in XML format (Page 6, Paragraph 0175). 
XML is a general and highly flexible representation of any type of data. 
XML uniform tags make it easy to map one XML structure to another or to 
map XML into other data formats. 

Therefore, it would have been obvious to those skilled in the art at the time 
of the invention to modify Kanevsky with the use of XML as taught by 
Fitzpatrick et al. since it would have made the data more uniform so that it 
can be transferred and mapped to other types of data. 

Conclusion 

6. Any response to this action should be mailed to: 

Commissioner of Patents and Trademarks 
Washington, D.C. 20231 
or faxed to: 
(703)872 9314, 



• 
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(for informal or draft communications, please label "PROPOSED" or 



DRAFT") 



Hand-delivered responses should be brought to Crystal Park II, 2121 Crystal Drive, Arlington, VA., 
Sixth Floor (Receptionist). 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Michael Lewis, telephone number (703) 305- 
8730. 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Ms. Doris To, can be reached at (703) 305-4827. The 
facsimile phone number for this group is (703) 872-9314. 

Any inquiry of a general nature or relating to the status of this application 
or proceeding should be directed to the Group 2600 receptionist whose 
telephone number is (703) 305-4750, the 2600 Customer Service telephone 
number is (703) 306-0377. 



mal 



12/22/2003 




SUPER*/(SORV PATENT EXAMINER 
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